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1 Advanced Computer Architecture 


(a) Describe why it is often important to try to limit off-chip memory accesses 


when designing a high-performance domain-specific accelerator that is part of a 
System-on-a-Chip (SoC)? [4 marks] 


What techniques or strategies might be employed to reduce off-chip memory 
accesses when designing a domain-specific accelerator? [8 marks] 


Imagine a domain-specific accelerator with numerous individual compute units. 
Each compute unit is programmable, has specialised functional units and 
produces requests to access memory. The compute units share a L1 cache. 


In order to help improve the effective bandwidth to the shared L1 cache, 
memory requests that access the same address may be coalesced (or merged). 
Experiments confirm that there is scope to coalesce requests both from a single 
unit and from different units. 


You are asked to design a hardware scheme for coalescing load requests only, 
store requests are not coalesced. The compute units generate loads and stores in 
program order. For all requests from a single compute unit, the coalescing unit 
must ensure that stores are not reordered with respect to loads or other stores 
from the same compute unit. The accelerator provides no guarantees about the 
way in which requests from different compute units may be reordered. 


The design should be efficient and scalable, i.e. make good use of any on-chip 
memory required and make some attempt to minimise the number of address 
comparisons required. It should be possible to coalesce requests that are made 
from different compute units. There is no need to discuss how data is returned 
from the L1 cache to the compute units. 


Describe your design, list any assumptions and justify your decision decisions. 
[8 marks] 
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2 Bioinformatics 


(a) 


What is the output of the UPGMA algorithm given the distance matrix of the 
species a,b, c,d,e below? 


a be ad ee 


a QO 16 -21:- 31. 20 
b 0 30 234 21 
Cc 0 28 38 
d QO 42 
€ 0 


[4 marks] 


Compute the neighbour joining phylogeny from the four species (s1,s2,s3,s4) 
DNA sequences. 


sl: GATAA 
s2: GATAC 
s3: CTTTC 
s4: CTGGG 

[4 marks] 
Compute the Burrows-Wheeler transform on the string PARALLELISM. 

[4 marks] 
Discuss the concept of modularity in the Louvain algorithm. [4 marks] 


Discuss the requirement of well-stirred chemical solution for the Gillespie 
Algorithm. [4 marks] 
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3 Business Studies 


A PhD student has made an AJ-powered chat system available on the Internet, 
through a technology preview on their departmental webpage. The student has 
attracted millions of users, and has exhausted their PhD-funded server resource. 
They have been asked by the department to remove the preview from the university’s 
servers and find a way to fund it themself. 


(a) Referencing the customer adoption curve, how should the student go about 
analysing early product use to inform their development plan for the product? 
[5 marks] 


(b) Outline a plan to enable the student to build a business around a dialogue-based 
AI chatbot, including explicit risk mitigation strategies for the 5 major risks to 
the product’s success. [15 marks] 
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4 Cryptography 


A building access-control smartcard uses the following authentication protocol. Let 
G be a generator for an elliptic-curve based cyclic group (E(Z», a,b), +) of order g. 
The card stores in its non-volatile memory a secret key u € Zj and a public card 
identifier U := uG. The door does not know u. Curve parameters were chosen such 
that determining u from curve point U is computationally infeasible. 


When the user holds the contactless card in front of the door reader, the card picks 
a number c € Z? and the door picks a number d € Zj, both uniformly at random. 
The card calculates the coordinates of elliptic-curve point C := cG. 


They then exchange the following three messages: 


card door 


read u, U from memory 


U,C 
pick c € Zi, C:=cG qd) wo) 
a pick d € Z; 
r :=(c+du) mod q eee 


What checks should the door perform on the received values U,C,r to verify 
that the card identified by U really is in possession of u? [4 marks] 


How many bits will be required to encode the values exchanged in these three 
messages in order to achieve a security level similar to the use of a 128-bit key 
in a symmetric MAC? [4 marks] 


Your colleague is concerned that the calculation of C := cG in the card slows 
down the authentication process too much, and therefore proposes to postpone 
transmission of C' to the third message, i.e. to change the three protocol messages 
from previously (U,C), d, r to now U, d, (C,r). Would this affect security? 

[5 marks] 


Due to supply-chain issues, the hardware manufacturer no longer can make door 
readers that send data to the card. Modify the original protocol such that only 
the card sends data to the door. The card maintains a counter m for how often it 
has been used, and the door remembers the highest value of m it has previously 
seen and will only open again when presented with a new value m higher than 
any seen before. Instead of receiving d in message Q), let the card calculate d 
in a way such that the card provides a digital signature of m, and sends (d,m) 
to the card. Keep message @) the same. [5 marks] 


Which value appearing in the original protocol no longer has to be transmitted 
by the unidirectional variant from Part (d), and why? [2 marks] 
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5 Denotational Semantics 


(a) Consider the following definitions: 


For L € PCF, and k EN, L Ito k if, and only if, L nq, succ*(0). 


For M € PCF yrgsna and f: NN, M IF, f if, and only if, for alli EN, 
M succ'(0) nat suce/ (0). 


For N € PCFygsnaona andg: Nx NN, N |Fo g if, and only if, for all 
i,j © N, N succ’(0) succ4(0) {nae Succ?) (0). 


Prove that N lFp g and Llko k imply NL ify Aw E N.g(k, x). [6 marks] 
Prove that there are N € PCF nai-snatsnat and a bijection g: N x N>~ N 
such that N lFy g. You may use standard results provided that you state 
them clearly. [6 marks] 
Say whether or not the following statement holds: 

For all PCF types 7 and all closed PCF terms M of type tT > tT > 7, the 
closed PCF terms fix(fn « : 7. fix(fn y: 7..M y x)) and fix(fn z: 7. M zz) 


of type 7 are contextually equivalent. [2 marks] 


Either prove or disprove the above statement. [6 marks] 
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6 Hoare Logic and Model Checking 


Consider the temporal logic CTL over atomic propositions p € AP: 
w € StateProp::= 1 | T | ad | di Ade |W Ve2/| Yi > 2|PIAGIES, 
@ € PathProp:=Xw|Fuwl|Gw| yy, U wy, 


(a) Specify the following properties as CTL formulae over AP = {p,q}. 


(i) There exists a path such that at some point p will always hold. [2 marks] 


(ii) There exists a path such that at some point q holds, and from any state 
along the path until then, a state satisfying p can be reached. [3 marks] 


Consider a temporal model M over atomic propositions AP = {p,q,r}, with 
states {1,2,3,4,5}, initial states 1 and 2, and transitions and state labelling 
as shown in the diagram (e.g. in state 1, atomic propositions p and r hold). 
Informally describe the meaning of each of the following CTL formulae over AP 
and explain why they hold in the model or give a counterexample if they do not. 


Q | l 


atq} S itp} S 24r} 


iN 


5} — 3{r} 
(i) A(r U (EXq)) [2 marks| 
(a1) (AFp) A (AGEFq) [3 marks] 
Let M be the model from (b), over atomic propositions AP = {p,q,r}, and M’ 


the model over atomic propositions AP’ = {p,r} with states 11, 12, and 13, 
initial states 13 and 12 and labelling and transitions as shown below. 


| Q 
ae i347} 2ietp. rl) S114} 
PES on. sete 


(i) Show that M’ simulates M: define a relation R and show M <*® M’. 
[6 marks] 


(i7) Is your relation R a bi-simulation? Explain why or why not. [4 marks] 
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7 Information Theory 


You are tasked with compressing a continuous textual data stream into a stream of 
binary codewords. You know the stream’s alphabet contains three letters (‘A’, ‘B’, 
‘C’) and three numbers (‘1’, ‘2’, ‘3’), and that the stream alternates between letter 
and number. You have been provided with a sample of 100 consecutive characters 
and no more as follows: 


A1A3A1C2B2 
A3B2A2B2C2 
C38C3C3C3A1 
B1B2B1C1C1 
A1C3A1A2A1 
A1A2A1A2B3 
A2B3A1A3B3 
C2A3A3C2A1 
C2C2A3A3C3 
C2A3A3B2A3 


(a) For each of the following source models find an encoding using Huffman codes 
and compare the average encoded character length to the entropy in bits per 
character. 


(7) A pure character source, ignoring the alternating nature of the characters. 
3 marks 


(77) A mixture of two distinct sources, one for letters and one for numbers. 
4 marks 


(a7) A stream of two-character symbols (‘Al’, ‘A2’, etc.). 5 marks 


(b) Explain conceptually the trend in entropy values you found in part (a). Which 
do you think is closer to the true entropy and why? 4 marks 


(c) Discuss whether it would be advantageous to model the stream as a stream of 
four-character symbols and apply Huffman coding. [2 marks] 


(d) Give two advantages to using arithmetic coding instead of Huffman coding for 
this problem. [2 marks] 
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8 Machine Learning and Bayesian Inference 


You are faced with the following simple inference problem. Examples are pairs (2, y), 
where both x and y are real numbers. You suspect that there is a relationship y = 0x 
underlying the data, and that the y values are subject to additive Gaussian noise 
with mean 0 and variance o?. You wish to infer 6 from examples (2;, y;) where 
i=1,...,m. You may consider x values to be fixed. 


(a) 


Write down an expression for the density p(Y|6; xz). You may use the standard 
expression for the Gaussian density N (1, 07) 


pl) = A exp (-5(2 -w)"). 


[1 mark] 


Write down an expression for the likelihood of the parameter 0, given the m 
training examples described. State any assumptions you make. [3 marks] 


Assuming that the parameter 0 has a Gaussian prior with mean 0 and variance 
o?, write down an expression for the posterior density p(@|y;x) of 6. You may 
leave this expression unnormalized. State any assumptions you make. 

[3 marks] 


Explain how the calculation in Part (c) might be extended to obtain the evidence, 
and how this might be used to estimate the values of o and o, from the 
training data. State any assumptions needed. You need not perform the actual 
calculation. [3 marks] 


Find an expression for the predictive density p(Y|x,x,y). You need not 
normalize the expression. You may use the identity 


b’Ab 
| exp (-3 (x"Ax + x™b+ °)) = (2r)?/?| A|-1/? exp (-3 (c- )) 
RP 


[7 marks] 


What further steps are necessary should you wish to extend your result from 
Part (e) to obtain actual predictions for Y along with a measure of certainty in 
the prediction? [3 marks] 
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9 Optimising Compilers 
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Compilers use intermediate representations (IRs) when optimising code. 


(a) For each of the IRs below, describe its merits for performing dead code 
elimination, providing a diagram or pseudo-code to illustrate your answer. 


(i) An abstract syntax tree. 


(ii) A three-address code. 


(iit) A stack-based IR. 


[4 marks] 
[4 marks] 


[4 marks] 


(b) A decompiler is given the following IR code for a function, where registers r0 
and r1 contain the function arguments and r0 contains the returned value: 


foo: mov. r2, #0 //1r2=0 
cmple r0, #0, end // Branch to end if r0O <= 0 
top: shl r0, r0O, #2 // x0 = r0 << 2 
ldr r0, [ri, rO] // r0 = MEM[ri + r0] 
cmpne rO, #5, chk // Branch to chk if rO !=5 
inc: add r2, r2, #1 //r2=y7y2+ 1 
chk: cmpgt r0, #0, top // Branch to top if r0 > 0 
end: mov fr0, r2 // x0 = r2 
ret // Return 
(7) Draw the dominance tree for this code. [2 marks] 


(2) Reconstruct a representative source code for this IR code assuming all types 
are integers or pointers. 
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[6 marks] 
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10 Principles of Communications 


(a) One simple equation for mean throughput experienced by a TCP connection is 


proportional to 
MSS 


BLL +4/p 
where MSS is the maximum segment size, or in other words, the data packet 
size; RTT is the round trip time; and p is the packet loss probability. 


Explain this equation, starting from the basic AIMD behaviour of TCP’s 
congestion control and avoidance window adjustment algorithm. Assume that 
a TCP flow is unidirectional and all data packets are maximum size, and that 
packet losses only happen due to congestion. In your answer, please explain 
any other assumptions, for example concerning the round trip time and link 
capacities on the path from sender to recipient. [10 marks} 


A clever person decides to mitigate the 1/RTT “unfairness” in the TCP 
throughput by building a scheduler that uses weighted round robin, and assigns 
weights to TCP flows, in inverse proportion to their RTT. 


Give reasons why this might not be an appropriate solution. Also give reasons 
why this might not be easy to implement. [10 marks] 
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11 Quantum Computing 


(a) 


(b) 


(1) State and prove the no-cloning principle. [2 marks] 


(77) Using standard quantum gates, construct a 2-qubit unitary circuit U that 
copies the orthogonal states |+) and |—). That is, U should have the 
properties U(|+)|0)) = |+)|4+) and U(|—)|0)) = |—)|-). [3 marks] 


An experimenter has prepared an initial 2-qubit state, and plans to measure 
the first qubit in the computational basis, and then measure the second qubit. 
For each of the following statements, identify the initial states that make it 
true. 


(1) Ifthe second qubit is measured in the computational basis, the outcome of 
the second measurement can always be predicted with certainty, once the 
outcome of the first measurement is known. 


(17) If the second qubit is measured in the computational basis, the outcome 
of the second measurement can sometimes be predicted with certainty, 
depending on the outcome of the first measurement. 


(it) Regardless of the choice of basis for the measurement of the second 
qubit, the probability of obtaining each possible outcome for the second 
measurement is independent of the outcome of the first measurement. 


[5 marks] 


The experimenter plans to adapt the scheme from (b), so that the basis for 
the second measurement can be chosen in a way that depends on the first 
measurement outcome. Is it always possible to choose a measurement basis 
such that both outcomes for the second measurement have a 50% probability? 
Justify your answer. [2 marks] 


Quantum phase estimation is to be performed for this 2-qubit unitary: 


1/V2 1/V2 0 0 


2 _ |1/v2 -1/v2 0 0 
oe ae 0 1/2 43/2 1/2-+7/2 
0 0 1/24+2/2 —1/2—7/2 


(1) Derive the eigenvectors, eigenvalues and eigenvalue phases of U, showing 
your working. [4 marks] 


(i7) If the second register is initialised in the state |00) what are the possible 
outcomes of running quantum phase estimation, and what is the probability 
with which each occurs? How many bits of precision are required for QPE 
to correctly estimate the phase for any initial state? [4 marks] 
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Randomised Algorithms 


Consider the following problem (P). We are given a directed graph G = (V, EF) with 
non-negative edge-weights w;,; > 0 for each edge (i, 7) € E. The task is to partition 
V into two sets V; and V2 = V \ V; so as to maximize the total weight of edges going 
from V; and Va, i.e., 


maximise y Wess 
(4,j)EE: i€V1 ,jJEV2 


(a) Design a randomised approximation algorithm for (P) with running time O(V) 


(b) 


and analyse its approximation ratio. [5 marks] 
In this part of the question, we additionally want that |V\| = |V2| = n/2 (we 
assume for simplicity that n = |V| is an even integer). Adjust the algorithm 


and analysis from (a). [4 marks] 


Consider the following integer program called (I): 


maximise ss Wij 2,5 
(i,j)EE 
subject to Bias Bj for each (i,j) € E 
26% Lae: for each (i,j) € E 
x; € {0,1} forie V 
25 © 0,1] for each (i,j) € E 
(i) Prove that this integer program solves the problem (P). [4 marks] 


(72) Consider the following randomised algorithm for (P). Let (Z%,%) be a 
solution to a linear program, which is identical to (I) but with x; € {0,1} 
replaced by x; € [0,1]. We then put each vertex i into Vi with probability 
1/44 %;/2, independently. 


(A) Explain briefly why this algorithm can be implemented in polynomial 
time. [1 mark] 


(B) Prove that the approximation ratio of this algorithm is 2. [6 marks] 
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13. Types 


(a) Derive the following entailments with the natural deduction system for classical 


logic. 
(i) Show =(A V B);-- 7A true. [5 marks] 
(it) Show -;=AV ABE A true. [5 marks] 


(b) (4) Using fold : Va.a—- (X ~a-— a) - Listx > a, 
cons : X — Listy — Listy and nil : Listy, write a System F function which 
appends two lists. [1 mark] 


(77) Give an OCaml data structure corresponding to the following Church 
encoding: 
Va.a> (a> X>a->a)>a 


[2 marks] 


(iii) Give a System F term which converts an element t of the type in part (iz) 
of this question into a list with the same elements. [3 marks] 


(c) Consider the following two Agda proofs: 


unitl a iS ae unitr DUA Her eat 
unitl 2 = refl(x) unitr 0 = refl(0) 
unitr (sm) = cong s (unitr 7) 
Explain why they are different. [4 marks] 


END OF PAPER 
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